Extending the Applicability and Improving the Performance of Runtime Parallelization
نویسندگان
چکیده
When static analysis of a sequential loop fails to yield reliable information on its dependence structure, a parallelizing compiler is left with three alternatives: it can take the conservative option of emitting code for a sequential execution; it can optimistically emit code to speculatively execute the loop as a DOALL [6, 7]; or it can emit inspector-executor code to determine the actual dependence structure at runtime and to respect it in a parallel execution [8, 9]. The rst approach is certain to yield a slow execution. The second approach gives very good results when the loop can in fact be executed as a DOALL, but is of no help otherwise. In this paper we concentrate on the nal approach, runtime parallelization through the inspectorexecutor method. We have two goals in this work. The rst is to expand the class of loop to which the approach may be applied by removing restrictions on the loop dependence structures that it can handle. To achieve this goal, we introduce new forms of the inspector and executor that together remove all restrictions on the loop dependence structure. Thus, we show how to parallelize a class of loop that previously would have compelled the compiler to emit sequential code. Our second goal is to improve the performance of the inspector-executor approach through specializations applicable when static analysis yields some (weak) information about the array indexing functions used in assignments. We validate our work through a set of examples designed to illustrate the fundamental performance tradeo s characterizing the specialized implementations, using results taken from executions on 32 processors of a KSR1.
منابع مشابه
Improving the RX Anomaly Detection Algorithm for Hyperspectral Images using FFT
Anomaly Detection (AD) has recently become an important application of target detection in hyperspectral images. The Reed-Xialoi (RX) is the most widely used AD algorithm that suffers from “small sample size” problem. The best solution for this problem is to use Dimensionality Reduction (DR) techniques as a pre-processing step for RX detector. Using this method not only improves the detection p...
متن کاملAdaptive Loop Tiling for a Multi-cluster CMP
Loop tiling is a fundamental optimization for improving data locality. Selecting the right tile size combined with the parallelization of loops can provide additional performance increases in the modern of Chip MultiProcessor (CMP) architectures. This paper presents a runtime optimization system which automatically parallelizes loops and searches empirically for the best tile sizes on a scalabl...
متن کاملPreventing Key Performance Indicators Violations Based on Proactive Runtime Adaptation in Service Oriented Environment
Key Performance Indicator (KPI) is a type of performance measurement that evaluates the success of an organization or a partial activity in which it engages. If during the running process instance the monitoring results show that the KPIs do not reach their target values, then the influential factors should be identified, and the appropriate adaptation strategies should be performed to prevent ...
متن کاملExtending a Task Farming Framework with Dependences to P2P Communications
Grid computing is becoming more and more popular, but contrary to earlier expectations the development of applications for Grid environments is still not possible without expert knowledge in the field of parallelization. Therefore, recent work [14, 4] has combined automatic loop parallelization (in the polytope model) with component-based Grid programming: First, automatic parallelization techn...
متن کاملImpact of linear dimensionality reduction methods on the performance of anomaly detection algorithms in hyperspectral images
Anomaly Detection (AD) has recently become an important application of hyperspectral images analysis. The goal of these algorithms is to find the objects in the image scene which are anomalous in comparison to their surrounding background. One way to improve the performance and runtime of these algorithms is to use Dimensionality Reduction (DR) techniques. This paper evaluates the effect of thr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995